Gene Function Prediction via Discriminative Graph Embedding
نویسندگان
چکیده
Gene function has been a subject of interest but it is far from fully understood. It is known that some genes have certain functions but it is not clear whether those are all the functions they have. It is a recent trend to use different means to predict gene functions; one of them is to use computational methods on large data sets. Different types of information are used in computational methods, each coming with different hypotheses and applicable to different part of genomes. We regard this problem as a prediction problem on networks. The idea is to use gene network information with a hypothesis that a link in the network means a highly probable similarity in gene functions. This is formulated as a node classification problem on graph with smooth label assignment with respect to the graph structure. We propose a method to embed the graph into a Euclidean space considering label information. The embedding is supposed to trade off the graph structure preservation to have a good discriminative ability. Then, label inference can be done via many machine learning techniques, in which we formulate a large margin spectral transform, known for its robustness. We apply the method to predict gene functions on gene network [1] and protein-protein interactions network [2]. Our method gives higher prediction ability than traditional ways of embedding graphs.
منابع مشابه
Link Prediction using Network Embedding based on Global Similarity
Background: The link prediction issue is one of the most widely used problems in complex network analysis. Link prediction requires knowing the background of previous link connections and combining them with available information. The link prediction local approaches with node structure objectives are fast in case of speed but are not accurate enough. On the other hand, the global link predicti...
متن کاملScaling up Structured Multi-Label Prediction using Discriminative Mean Field Networks
Multi-label classification is an important task in many modern machine learning applications. Accurate methods model the correlations and relationships between labels, either by assuming a low-dimensional embedding of the labels or a graph structure of label dependencies. While such interactions can be achieved using feed-forward predictors, problems with tight coupling between labels are bette...
متن کاملDeep Feature Learning via Structured Graph Laplacian Embedding for Person Re-Identification
Learning the distance metric between pairs of examples is of great importance for visual recognition, especially for person re-identification (Re-Id). Recently, the contrastive and triplet loss are proposed to enhance the discriminative power of the deeply learned features, and have achieved remarkable success. As can be seen, either the contrastive or triplet loss is just one special case of t...
متن کاملSemi-supervised Graph Embedding Approach to Dynamic Link Prediction
We propose a simple discrete time semi–supervised graph embedding approach to link prediction in dynamic networks. The learned embedding reflects information from both the temporal and cross–sectional network structures, which is performed by defining the loss function as a weighted sum of the supervised loss from past dynamics and the unsupervised loss of predicting the neighborhood context in...
متن کاملDiscriminative prototype selection methods for graph embedding
Graphs possess a strong representational power for many types of patterns. However, a main limitation in their use for pattern analysis derives from their difficult mathematical treatment. One way of circumventing this problem is that of transforming the graphs into a vector space by means of graph embedding. Such an embedding can be conveniently obtained by using a set of ‘‘prototype’’ graphs ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009